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ABSTRACT 

Recent studies have shown that topological con- 
straints encoded at the RNA secondary structure 
level involving basic steric and stereochemical 
forces can significantly restrict the orientations 
sampled by helices across two-way RNA junctions. 
Here, we formulate these topological constraints in 
greater quantitative detail and use this topological 
framework to rationalize long-standing but poorly 
understood observations regarding the basic behav- 
ior of RNA two-way junctions. Notably, we show that 
the asymmetric nature of the A-form helix and the 
finite length of a bulge provide a physical basis 
for the experimentally observed directionality and 
bulge-length amplitude dependence of bulge indu- 
ced inter-helical bends. We also find that the topo- 
logically allowed space can be modulated by 
variations in sequence, particularly with the addition 
of non-canonical GU base pairs at the junction, and, 
surprisingly, by the length of the 5' and 3' helices. A 
survey of two-way RNA junctions in the protein data 
bank confirms that junction residues have a strong 
preference to adopt looped-in, non-canonically 
base-paired conformations, providing a route for ex- 
tending our bulge-directed framework to internal 
loop motifs and implying a simplified link between 
secondary and tertiary structure. Finally, our results 
uncover a new simple mechanism for coupling 
junction-induced topological constraints with 
tertiary interactions. 

INTRODUCTION 

A fundamental goal of structural biology and biophysics 
is to understand the underlying forces that govern the 



folding and conformational behavior of macroniolecules. 
Stacking interactions and base-pairing have been shown 
to be the primary forces that drive the formation of RNA 
secondary structure, and thermodynamic frameworks that 
treat these interactions are available and can be used to 
predict RNA secondary structure with >70% success 
(1,2). Despite great advances in understanding the contri- 
butions of tertiary interactions in RNA folding and indeed 
its coupling to secondary structure and metal association 
(3-5), we are far from being able to predict an RNA fold 
based on a first principle understanding of RNA folding 
(6-8). The problem is even more complicated when one 
considers that RNA structures are typically highly flexible, 
and often adopt not one, but an ensemble of native con- 
formations that can vary in response to cellular signals 
and/or changes in the physiochemical conditions (9,10). 
Much less is known about the principles that guide such 
dynamic aspects of RNA conformation (11). 

RNA secondary structure is dominated by A-form 
helical domains that carry various receptors involved in 
tertiary interactions and are linked together by a variety of 
junctions, including bulges, internal loops, three-way and 
other higher order junctions. Lilley (12-14), Griffith (15), 
Draper (16) and Hagerman (17,18) showed more than two 
decades ago that bulges and internal loops induce direc- 
tional bends in DNA and RNA duplexes by amounts de- 
pendent on the length, asymmetry and sequence of the 
junction as well as on ionic strength. Surveys of higher 
order junction atomic structures have also suggested that 
there is strong dependence of junction conformation on 
secondary structure (19-21). While it has long been re- 
cognized that junctions can direct specific inter-helical 
conformations, the degree to which secondary structure 
alone defines RNA global structure has only recently been 
fully appreciated and placed on a quantitative footing 
(22). In particular, studies by us (23) and the Herschlag 
group (24) have shown that two-way junctions impose 
steric and connectivity constraints that restrict the 
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relative orientation of A-form helices. These constraints 
include inter-helix steric clashes and connectivity con- 
straints due to the finite length of the junction nucleotides 
linking the helices. Using a simple heuristic model that 
assumed a canonical A-form helix geometry and maximum 
inter-nucleotide distance of 4.9 A, we showed that these 
constraints alone restrict the range of accessible inter- 
helical conformations to between 4 and 20%, and on 
average 7%, of the total space (23). Despite this narrow 
range the model accommodates ~85% of all inter-helical 
orientations observed in all two-way junctions in the PDB, 
including bulges and internal loops (23). 

Here, we explore how various parameters of interest 
influence the topologically allowed space with the goal 
of developing a deeper and more predictive understanding 
of how two-way junctions encode 3D RNA conformation 
and dynamics. In particular, we examine how the allowed 
conformational space varies with local variations in 
A-form hehx geometry, the presence/absence of non- 
canonical base pairs at the junction, relative lengths of 
the 5' and 3' helices and overall junction topology (bulge, 
symmetric and asymmetric internal loops). We also put on 
a firm quantitative footing the well-known yet poorly 
understood directionality of junction-induced bends as 
well as explain the origins of recently reported correlated 
variations in the inter-helical twist and bend angles 
observed across two-way junctions (23,25). We also survey 
the structures of symmetric and asymmetric internal loops 
and show that the resulting allowed inter-helical conform- 
ational space is accurately modeled by assuming that 
internal loops will maximize non-canonical base pairing, 
marking a significant simplification in the link between 
RNA secondary and tertiary structure. Our results also 
uncover a new simple mechanism for coupling junction- 
induced topological constraints with tertiary interactions. 

METHODS 

Computing topologically allowed inter-helical orientations 

We utilize the H/S^-H/Sy notation (23,26) to describe the 
topology of a two-hehx junction where ; and / specify the 
length of the 5' and 3' helices, respectively, and X and Y 
{X>Y) denote the number of single-stranded nucleotides 
in the 5' and 3' strands with the respective strands assigned 
in accordance to our prior conventions (23). For simplicity 
we refer to a junction topology in the body of the text as 
Sjk-Sr, with the values of i and / implied by context; junc- 
tions from the PDB possess i,j> 3, and in reference to our 
model, unless otherwise stated, / = / = 4. For a structure 
of a given two-way junction topology, the inter-helical 
conformation can be defined through the use of the 
three Euler angles needed to describe the orientation of 
the chiral helical objects with respect to one another. As 
described previously (23,27) these angles, denoted ah Ph 
Yh, are obtained from the rotation matrix needed to trans- 
form the 3' helix from its observed orientation to one 
where it is coaxially stacked upon its 5' partner, and 
where aj, and Yh are interpreted as twists around the 
3' and 5' hehces and /Sh as an inter-hehcal bend angle 
(Figure lA). 



As described previously (23), this representation of 
inter-helical conformation can readily be inverted to 
compute the sterically and connectivity allowed conform- 
ations. Briefly, an idealized A-form helix with helical axis 
oriented along the positive z-axis from 5' to 3' direction 
was divided into two sub-helices, denoted the 5' and 3' 
helices, and with the angle between the 5' hehx closing 
base pair j'-axis and the molecular v-axis, Th, set to 
51.1° (27). The entire system was translated such that 
the first phosphorous atom belonging to the 5' helix on 
the strand oriented along the negative z-direction from the 
5' to 3' end was located at the origin, serving as the 
rotation pivot point. A given aj, Ai Yh inter-helical orien- 
tation was generated by applying the Euler rotation 
Rzi—Jh) X Ryi—Ph) X Rz{—oih) to the 3' helix while keep- 
ing the 5' helix fixed, where Rz and Ry refer to Euler 
rotations of an object about the Z-axis and F-axis, re- 
spectively (27). Rotations were performed for all non- 
degenerate (27) permutations of ah Yh on a 5° grid. 
Conformations that resulted in inter-helix collisions, taken 
as an inter-helical atomic distance of <1.4A excluding 
hydrogen atoms and OIF and 02P atoms connected to 
the pivot P atom, were classified as sterically disaUowed. If 
the distance between the last 03' atom of the 5' helix and 
first P atom of the 3'helix at the division between the two 
helices on the strand opposite the pivot was greater than 
the hnker cutoff of 7.26 A per bulged nucleotide then the 
conformation was classified as connectivity disallowed. 
This corresponds to a cutoff of 7.26 A, 14.52 A, 21.78 A 
and 29.04 A when calculating connectivity constraints for 
Si So, S2S0, S3S0 and S4S0 bulges, respectively. The fraction 
of space allowed for a given junction is computed by divi- 
ding the number of aUowed conformations by the number 
of unique ah Yh permutations, 191 736 (27). 

Computing topologically allowed space with translational 
relaxation 

We computed the topologically aflowed conformations 
when allowing for some degree of inter-helix translation 
that may reheve inter-hehx steric collisions. Translational 
degrees of freedom were defined as deviations in the x, y 
and z coordinates of a 3' hehx relative to the position of an 
idealized reference 3' helix that has the same ah P^ Yh 
orientation. Analysis of crystal structures of SiSq, S2S0, 
S3S0 and S4S0 junctions in our database (see PDB survey 
of two-way junctions) revealed that these translational de- 
viations are roughly isotropic in x, y and z, centered near 
0, and on average ^ +y^+z^<2A. We therefore aUowed 
1-2 A of translational freedom in each of the .t, y and z 
directions to be representative of that found in the PDB. 
To simulate the effects that these extra degrees of freedom 
may have to relieve inter-helix steric collisions, we 
modified our rotation protocol. As in the translation-free 
procedure, a rotation i?z(— Yh) x Ryi—Ph) x i?z(— «h) is 
applied to the 3' idealized helix, iHII, while keeping the 
5' helix, iHI, fixed. Following each rotation, iHII was 
translated by all combinations of — 1 , 0 and 1 A in the x, 
y and z directions for a 1-A cutoff, or by all combinations 
of —2, —1, 0, 1 and 2 A for a 2-A cutoff. This procedure 
captures >80% of the inter-helix translations observed in 



894 Nucleic Acids Research, 2012, Vol. 40, No. 2 



the PDB. A given inter- helical angle was accepted provided 
one of the translations alleviated any existing steric colh- 
sions. Connectivity-allowed conformations were taken to 
be the same those for the translation free case. 

Computing the topologically allowed space with 
non-idealized helices 

Eighteen 8-bp continuous helices of different sequences 
were chosen from junctions returned from a search of 
the RNA FRABASE (28) and were divided into 
2 X 4-bp helices, named the 5' and 3' helices, respectively 
(Supplementary Table SI). The hehces were chosen such 
that all combinations of WC pairs and two GU wobble 
pairs were represented at the 'junction' between the two 
4-bp helices. To remove any inter-helical bends or twists 
the 5' and 3' helices of each 2 x 4-bp system were individu- 
ally superimposed on top of their corresponding helices in 
the idealized A-form system used in the idealized topo- 
logically allowed computations. In cases where the pivot 
P atom was not located at the origin after the superpos- 
itions, a translation was applied to both 5' and 3' helices 
such that the pivot was at the origin as desired. After these 
'non-idealized' systems were prepared, the same rotation 
procedure used for the idealized helices was used. 

Cylinder rotations 

An in-house Python script was used to generate two cy- 
hnders of diameter 17.5 A, height 24 A and surfaces rep- 
resented by a ~1.5-A square mesh (distance between 
points was closer on cyhnder ends). The chosen diameter 
of 17.5 A corresponds to the distance between P(/) and 03' 
(/■) across the idealized A-form helix in Watson-Crick 
base-paired residues / and /. The bottom cyhnder axes 
were vectors originating at (8.75,0,-0.75) and (0,0,0) 
and running along the negative z direction for the bulge 
and center connected systems, respectively. The top cyhn- 
der axes began at (8.75,0,0.75) for the bulge and 
(0,0,14.52) for the center connected systems and ran 
along the positive z direction. Orientations that resulted 
in mesh points from opposing cylinders being <1.5A 
apart were determined to be sterically disallowed. 
Connectivity constraints for the bulge-like system were 
implemented by requiring that the distance between the 
linker termini, (17.5,0,-0.75) and ^(— yh,— /Jh,— ah) x 
(17.5,0.0.75), were < 14.52 A. The linker length of the 
center-connected cyhnder system is invariant across all ro- 
tations and is thus free of connectivity constraints. 

PDB survey of two-way junctions 

A search of RNA two-way junctions was performed using 
the RNA FRABASE (28) on 25 June 2010 for H3S1H3S0, 
H3S2H3S0, H3S3H3S0 and H3S4H3S0 bulge motifs; on 27 
July 2010 for H3S1H3S1, H3S2H3S1, H3S3H3S1, H3S4H3S1, 
H3S2H3S2, H3S3H3S2, H3S4H3S2, H3S3H3S3, H3S4H3S3 
and H3S4H3S4 internal loop motifs; and 22 September 
2010 for all HfiH^ helices (or 6-bp continuous helices) 
as described previously (23). Inter- helical orientations 
for each identified junction were measured using 
in-house Perl scripts and the ah Ph Th convention of 
Bailor et at. (23,27,29). Unless otherwise noted, angles 



for nuclear magnetic resonance (NMR) structures 
were measured on the first model reported in the 
ensemble. The <2% of junctions identified^ by the RNA 
FRABASE whose flanking hehces were >2 A in backbone 
RMSD from idealized A-form, had a strand possessing 
all deoxyribose sugars, or had non-continuous chain 
connectivity at the junction were excluded from our 
analysis. Measured orientations are designated as fall- 
ing within the topologically allowed space if the 
shortest distance between the (ah/ih/h) measured for 
conformation i and (a^Plyl) for topologically allowed 

point T, d=^ (a;; - a,^-+(/J|; - ^if+ivl - yI)\ is <5°. 
Bulge-hnker distances were calculated along the bulge 
strand between the 03' atom of the 5' helix closing base 
pair and the P atom of the 3' helix's first base pair with an 
in-house Perl script (23). 

In-house Perl scripts were also used for the determin- 
ation of the inter-helical translations of each junction by 
adapting the procedure used to determine inter-helical 
angles. In obtaining the ah ySh Yh for a junction a reference 
helix, denoted iH2', is superimposed on top of the 3' hehx 
and then this iH2' is transformed to an orientation that is 
perfectly coaxially stacked upon the 5' helix (27). This 
transformation is done by first translating the center of 
mass of iH2' to the origin, performing a rotation, and 
then translating the rotated iH2' from the origin to its 
coaxially stacked position. The difference between these 
first and second translations defines a net translation of 
the center of mass of iH2' from its coaxially stacked 
position. This measured net translation is composed of 
two terms, the movement of the center of mass due the 
rotation of the helix and the deviation of the helix away 
from this 'pure rotation' position. We calculated the 'pure 
rotation' position of a given ah Ai Yh by applying a 
rotation of — yh — ySh —ah to the 3' helix of an idealized, 
coaxially stacked hehcal system, and then determined 
the net translation incurred when transforming this 'pure 
rotation' conformation back to its original coaxially 
stacked state. The difference between the net translation 
measured for the PDB conformation and net translation 
of the 'pure rotation' conformation was taken as the trans- 
lational deviation of the junction. 

FR3D analysis of internal loop motifs 

FR3D (Find RNA 3D) (30) pairwise interaction files that 
were available for PDBs within our database were down- 
loaded from the FR3D website at http://rna.bgsu.edu/ 
FR3D/AnalyzedStructures/PDB/PDB_interactions.html 
on 23 August 2010, where PDB is replaced with the name 
of the PDB structure of interest. The interaction files were 
parsed with in-house Python scripts to search for base 
pairing and stacking interactions. Bases were assigned as 
'looped-in' if FR3D indicated that they possessed a 
stacking interaction with residues of either strand of the 
junction or with the bases comprising the closing base 
pairs of the flanking helices. The <2% of junctions 
without available FR3D files were excluded from the 
analysis. 
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Identification of tertiary interactions 

Intra-RNA tertiary interactions were identified by search- 
ing the FR3D interaction files for pairing or stacking inter- 
actions between junction or closing base pair nucleotides 
and non-junction nucleotides with an in-house Python 
script. Junctions without available FR3D files were by 
default classified as having tertiary contacts. A separate 
in-house python script was used to search for protein, 
ligand and RNA tertiary interactions that were not 
identified by FR3D. PDB files corresponding to each 
junction in our database were searched exhaustively for 
any C, N, S, O or P atoms within 4 A of junction or 
closing base pair atoms while excluding atoms from the 
chain(s) containing the junction and all 'HOH' O atoms. 

Manual annotation of H3S2.2H3S2 junctions 

Crystal structures of H3S2.2H3S2 junctions within our data- 
base were sorted according to sequence and inter-helical 
angles. Junctions with identical sequences and within an 
11.5° distance in ah Ai Yh space were clustered together 
and assumed to have common features. One junction from 
each group was then manually inspected and each loop 
base was qualitatively classified as to whether it was 
non-canonically base paired, 'near non-canonically paired' 
or neither, and whether it was stacked into the junction. 
Base paired residues were defined as two bases from oppos- 
ing strands having at least one hydrogen bond donor to 
acceptor distance of <3.6A and angle >125°. Near 
non-canonical base pairs were defined as two bases from 
opposing strands that were roughly coplanar and looped 
into the junction but did not possess clear hydrogen 
bonds. Tertiary contacts, defined as protein, ligand, crystal 
or other non-junction RNA hydrogen bonding contacts 
with any junction or closing base pair base (excluding 
backbone contacts) were also recorded. 

RESULTS AND DISCUSSION 

Bulges 

Sterics. We performed an exhaustive search of the inter- 
helical conformational space accessible to a bulge by carry- 
ing out a series of rigid body rotations to two, 4-bp-long, 
idealized A-form helices. In Figure IB, we show the 
inter-helical orientations allowed after excluding all con- 
formations that possess inter-helix steric clashes. The 
angles a\^ and yh. denoting the twists of the 3' and 5' 
helices about their respective hehcal axes, and fi^, the 
inter-helical bend angle, are plotted as 3 x 2D projections 
of the 3D tth Ph Yh space of inter-helical conformations 
(27). Of the theoretically possible ah /ih Yh orientations 
(27), 53% are excluded by steric constraints alone. 
Visual inspection of the projections also reveals that the allo- 
wed orientations are asymmetric in ah and yh. Calculations 
using perfectly symmetric cylinders (Supplementary 
Figure SI) reveal that the asymmetry of the sterically 
allowed space arises from the inherent asymmetry and 
chirality of the helices. 

A notable consequence of the anisotropy of the steric- 
ally allowed space is that it encodes a specific 



directionality to any bending that occurs between the 
two helices (Figure IC). Steric colhsions between the 
closing base pairs of the two helices make it such that 
the density of accessible states is much greater for bends 
that correspond to movement of the 3' helix into the space 
above the plane of the 5' closing base pair and 'away' from 
the 5' hehx, rather than into the closing base pair plane 
and 'towards' the 5' hehx. This result provides a quanti- 
tative and simple explanation for the directionahty of 
RNA and DNA bulge-induced bends (12,14,16,31,32). 

Interestingly, the sterically allowed orientations are also 
dependent on helix length. While 53 "/o of conformations 
are excluded with 4-bp helices, the fraction excluded in- 
creases to 62% and 69% for 6-bp and 8-bp helices, re- 
spectively, and decreases to 49% with 3-bp hehces. The 
steric constraints only depend on the length of the 5' helix 
when the 3' helix has >3bp (Figure ID). This can be 
attributed to the fact that the helices are asymmetric 
about the rotation origin (pivot P atom), which is 
located closer to atoms in the 5' helix as compared to 
the 3' helix. As a result, elongation of the 5' helix results 
in steric occlusion, whereas the new volume occupied by 
elongation of the 3' helix falls outside of the sphere swept 
by the 3' helix. Thus, varying the length of the 5' helix can, 
in principle, modulate the allowed inter-helical space. 

The above steric model assumes an idealized A-form 
helix geometry. However, RNA helices can deviate from 
the canonical geometry and this in turn can affect the 
sterically allowed conformations. To examine these con- 
tributions, we computed the sterically allowed conform- 
ations when using 18 non-idealized 2 x 4-bp helices taken 
from the PDB (Supplementary Table SI). Each sequence 
had different closing base pairs at the site of the junction, 
one representative for each of the 16 unique canonical 
base pairs, and two sequences possessing a GU wobble 
pair at the junction. The fraction of sterically disaUowed 
conformations varied between 35 and 50% and on average 
44% of the total space, in good agreement with 53% when 
using idealized A-form helices. On average, 97% of the 
orientations that are allowed when using idealized 
A-form helices are also allowed when using non-idealized 
helices, whereas an average of 82% of the orientations 
sampled by the non-idealized helices are accessible to 
idealized A-from helices. Thus, local distortions in the 
helix geometry can slightly increase the space that can 
be sampled and allow access to otherwise sterically dis- 
allowed inter-helical conformations. 

Our model also assumes that the two helices are fixed at 
the pivot point. However, some of the steric constraints 
can, in principle, be relaxed through translations of the 
helices relative to one another. Analysis of the bulge 
motifs within our structural database revealed __that the 
average net translation from the origin is <2A. If the 
sterically aUowed space is recalculated with an allowance 
for ~ 1 A translational relaxation the fraction of sterically 
aUowed conformations increases from 47% to 68% and 
with ~2 A to 80%. Studies that apply a more sophisticated 
treatment of the couphng between rotational and transla- 
tional degrees of freedom are needed to clarify the signifi- 
cance of inter-helix translations in determining junction 
conformation. 
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Figure 1. Sterically allowed inter-helical conformations. (A) ai, /ij, yj, convention used to describe inter-helical orientations and (B) ffh-Th- "h^/'h and 
Jh-fih projections of the conformations allowed (black) given the steric constraints. (C) Density of allowed conformations of the upper (3') helical 
axis in relation to the fixed lower (5') helical axis projected in the plane perpendicular to the lower helical axis. A cartoon of the relationship between 
a helix conformation and a projected point as well as the coordinate frame of the closing bp of the 5' helix (67) in relation to the coordinate frame of 
the projection is shown for reference. (D) Fraction of a?h fit, Th space sterically accessible as a function of length of the 5' and 3' helices. 



As a whole, these resuhs suggest that steric constraints 
exclude 20-53% of the possible inter-helical orientations. 
However, we emphasize that these values represent a 
lower bound to the degree of steric confinement in RNA 
junctions. The heavy atom van der Waals diameter of 
1.4 A used in our models is well below commonly accepted 
values of 3^ A. Inclusion of hydrogens and the solvation 
sheU wiU also increase the effective van der Waals radii. 
Finally, it is likely that electrostatic repulsion may serve to 
further narrow the space (33). 

Connectivity. The set of conformations that are accessible 
to a bulge junction is also subject to connectivity con- 
straints arising from the finite length of the bulged nucleo- 
tides linking the helices. We approximate these constraints 
by requiring that the end-to-end distance between the last 
03' atom of the lower (5') helix and the first P atom of the 
upper (3') hehx to be less than a given cutoff length. We 
previously used a bulge length cutoff equal to 4.9 A per 
nucleotide, which was assumed to be the maximum length 
of a given residue (23). Here, we compute the average per 
bulge residue distance length for all H3S4>x>iH3So junc- 
tions within our database. The computed distribution is 
roughly Gaussian with a mean of 5.78 A and a standard 
deviation of 1.48 A (Supplementary Figures S2 and S3). In 
the new calculations, we use a cutoff of 7.26 A per bulged 
nucleotide, or the mean plus one standard deviation, 
which accommodates 84% of the per-nucleotide bulge 
lengths found within the PDB. With the distance cutoff. 



the connectivity constraints alone exclude 95, 83, 62 and 
31% of the possible orientations for Si So, S2S0, S3S0 and 
S4S0 bulges, respectively (Figure 2A). 

Our analysis does not account for intra-linker and 
Unker-helix steric clashes. As a first estimate of the impact 
of such steric effects, we identified orientations where the 
line segment drawn between the 03' and P atoms (the 
linker termini) was <2A from either helix and within 
2 A of the cutoff distance, meaning a bulge connecting 
the two termini would have no alternative path. This 
analysis suggests that 0%, 1%, 2% and 4% of the con- 
nectivity allowed conformations may be inaccessible due 
Unker-helix steric overlaps for Si So, S2S0, S3S0 and S4S0 
motifs, respectively. When considering both steric and 
connectivity constraints, as discussed in greater detail in 
following sections, 0%, 2%, 8% and 11% of the SiSo, 
S2S0, S3S0 and S4S0 allowed conformations are estimated 
to contain linker-helix steric overlaps. 

The connectivity constraints, and their variation with 
bulge length, depend critically on the ratio of the per- 
nucleotide length of the bulge, which is determined by 
the backbone structure, and to the helix diameter, as 
determined by base-pair structure. In Figure 2B, we plot 
the results of a derived mathematical relationship 
(Supplementary Figure S4) that relates the set of accessible 
tth Ph Jh angles to the number of bulged nucleotides and 
the 'helix' diameter. At the effective diameter of the 
A-form helix, increasing the number of bulged nucleotides 
results in incremental increases in the size of the allowed 



Nucleic Acids Research, 2012, Vol 40, No. 2 897 



space up to 5nt, above which there are no connectivity 
constraints. However, for a hypothetically smaller helix 
diameter, there are fewer, more coarsely spaced putative 
levels. Conversely, there are greater, more finely spaced 
levels for larger helix diameters, underlining the unique 
tuneabihty of RNA junction conformation that is 
endowed by RNA's chemical structure. 

A strong prediction of Figure 2B as well as the com- 
puted topologically allowed space is that while the access- 
ible space increases with bulge length, it plateaus at 5nt. 
This prediction is in general agreement with the FRET 
experiments of Gohlke et al. (34) where the FRET effi- 
ciency between probes located at the ends of the two 
helices increased as the number bulged nucleotides in- 
creases from one to five but then changes very little for 
bulges that are 7 and 9nt long. Additionally, gel elec- 
trophoresis experiments also indicate a convergence to a 
common electrophoretic mobihty for bulges that are 
longer than 6nt (14,18,35). Interestingly, a search of 
bulge motifs in PDB with the RNA FRABASE (28) 
reveals a marked scarcity of bulges with >5nt (Figure 2C). 
This suggests that bulges primarily play the role of tuning 
the inter-helical orientation and there is rarely reason to 
have bulges >5nt long. 



Inter-helical correlations. A notable feature of Figure 2A 
is that the inter-helical angles do not vary independently 
of one another; rather we observe two linear correlations, 
one positive and one anti, in the a^-jh projection. Thus, 
each helix can only twist about its axis a certain amount, 
generally determined by the length of the hnker, before the 
other helix must compensate with a correlated twist or 
bend to maintain linker connectivity (Supplementary 
Figure S5). This codependence can be attributed to the 
unique topology of the nucleic acid two-way junction 
where the helices are linked at two positions around the 
helix circumference. By comparison, an artificial system in 
which helices are connected through a single, centrally 
located linker exhibits no correlations between ah and Yh 
(Supplementary Figure SID). 

The source of this correlation can be found in the math- 
ematical relationship hnking the hnker length, L, to the 
fixed helix diameter, D, and the apphed Yh rotation 
transformation (Supplementary Figure S4, Supplementary 
Information): 
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Figure 2. Connectivity allowed inter-helical conformations. (A) The set of conformations allowed given the connectivity constraints of 1 (light gray), 
2 (dark gray) and 3 (black) bulged nucleotides, where the allowed conformations for / number of bulged nucleotides include those for i—2, . . .1 
bulged nucleotides. (B) The percentage of ffj, /Sj, yi, orientations that are connectivity allowed as a function of effective helix diameter for a linker 
cutoff of 1 (black), 2 (red), 3 (green), 4 (blue), 5 (magenta) or 6 (orange) bulged nucleotides. The effective linker diameter of an A-forin helix, 17.5 A, 
is marked with a vertical dashed line. (C) The number of SiSo (black), S2S0 (red), S3S0 (green), S4S0 (blue), S5S0 (magenta) and S>(,So (orange) 
structures in PDB. 
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A derivative of Equation (1) with respect to an arbitrary 
time, t, while maintaining a constant inter-hehcal bend, /^h. 
and constant hnker length, shows that changes in ah must 
be offset by corresponding changes in yh through the 
equality 



da/, _ sin(o!h)cos(yh)+cos(Q!h)cos(j6h)sin(yiO dyi, 
dt — cos(Q;h) sin()4i) — sin(ah) cos(y6h) cos(>'h) d; 



(2) 



highlighting the interdependence of these quantities. In the 
proper limits. Equation (2) reduces to 



doh 
d? 



: ± 



dKh 
dt 



(3) 



and changes in which denotes a twist of the 3' helix, 
must be exactly offset corresponding changes in Yh. or 
twist of the 5' helix, and explaining the positive and 
negative ah-yh correlations in Figure 2A. 

Generally, we do not expect /ih and hnker length to 
remain fixed; rather all variables will be sampling values 
throughout their allowed range. Under this scenario, ah 
and Yh are no longer directly correlated as a change in ah 
or Yh could be offset by a change in the hnker length or /jh- 
However, the limited range of the linker length and a 
limited range due to steric constraints ensure that the 
system will experience the ah-yh constraints, giving rise to 
on-average correlated movements of the two helices as has 
been previously observed from experiment (23,25). These 
correlations will be particularly strong in junctions with 
smaller bulges where the range of accessible hnker lengths 
and ;3h's is more significantly restricted. 



Union of connectivity and sterics. The full range of access- 
ible inter-helical conformations is restrained by both con- 
nectivity and steric constraints. In Figure 3A, we show the 
intersection of both of the spaces. Visual comparison 
between Figures 2A and 3A reveals that sterics and con- 
nectivity exclude distinct parts of conformational space. 
For the case of SiSo and S2S0 bulges, 95% and 83% of 
the space is disallowed when considering connectivity 
alone, yet when combined with the steric constraints, 
98% and 95%, respectively, of the space is excluded. 
While the connectivity constraints are symmetric, this 
symmetry is broken through the imposition of the steric 
constraints. Conspicuously, the negative ah-yh correlation 
for the Si So and S2S0 bulges disappears and the range of 
accessible /Sh angles decreases; this implies that bends that 
maintain or shorten the distance between the two closing 
base pairs, and therefore preserve a short linker length, are 
precluded due to steric collisions, whereas bends that 
result in an opening of the space between the pairs, and 
are accessible to bulges with longer hnker lengths, are not. 

Nevertheless, this picture of the topological constraints 
is incomplete — A-form helices themselves possess intrinsic 
orientational degrees freedom that are independent of the 
constraints imposed by the bulge. Analysis of aU coaxially 
stacked 2 x 3-bp helices, or all continuous 6-bp helices, in 
the PDB revealed that the ah Yh values for these systems 
varied with -20 < (ah + yh) < 20 and -25<ySh<35 
(Supplementary Figure S6). To obtain a complete topo- 
logically aUowed space we take the union of those con- 
formations that satisfy either these constraints or those 
calculated for the bulge topology of interest with an 
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Figure 3. Union of sterics and connectivity constraints. (A) Plane projections of the set of conformations satisfying both the steric and connectivity 
constraints for 1 (hght gray), 2 (dark gray) or 3 (black) bulged nucleotides, where allowed conformations for / number of bulged nucleotides are 
inclusive of those for i—2, . . .1 bulged nucleotides. (B) Projections of the total topologically allowed space, obtained by taking the union of that 
in (A) with those conformations accessible to continuous helices and an additional 5° error. Inter-helical orientations measured for 728 H3S1H3S0, 
160 H3S2H3S0, 65 H3S3H3S0 and 10 H3S4H3S0 crystal structure junctions in the PDB are plotted in red, green, blue and orange, respectively. 
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additional 5° in each angle added to the bounds of this 
space. This 5° padding accounts for the errors inherent in 
the measurement of an arbitrary ah /Sj, Yh angle from the 
PDB as well as for capturing conformational states that 
may be accessible due to deviations from idealized A-form 
hehcal structure. 

To test our framework we measured how well we 
could capture the conformations found in the PDB. In 
Figure 3B, we plot the predicted aUowed conformations 
overlaid with conformations measured for the 963 
non-GU closing base pair crystal structure junctions in 
our database. Table 1 summarizes the results for the 
both the fraction of PDB structures that fall within the 
predicted idealized allowed space and the fraction of 
possible tth Ph Yh states that the space is confined to. It 
is remarkable that our model samples as little as 5% of the 
allowed space, and on average 18%, and yet is able to 
accommodate 100% of all known bulge structures. 
These results compare favorably to our prior study (23) 
where the topologically allowed space was confined to 4%, 
8%, 14% and 20% of possible states for SiSo, S2S0, S3S0 
and S4S0 motifs, respectively, accommodating ~84% of 
1195 cryo-EM, NMR, and crystal structure junctions. 
The differences in the degree of confinement for the 
larger bulge sizes are a function of the longer linker 
length used in the current model (7.26 A versus 4.9 A per 
nucleotide). 

While we have focused on X-ray structures in the above 
analysis, 100% and 97% of the 73 cryo-EM and 58 NMR 
junctions without GU closing base pairs, respectively, fall 
within the predicted region (Supplementary Figure S7). 
The two NMR outliers [(1A9L,2A9L),36] are from the 
same set of experimental restraints and have aj, ySh Th 
angles 12° and 16° away from the closest predicted 
allowed point. A MolProbity (37) analysis of the struc- 
tures revealed that there are significant steric clashes and 
deviations from accepted stereochemistry at the site of the 
S3S0 junction in question, suggesting that structural 
inaccuracies may be the cause of this disagreement. We 
also excluded junctions containing GU closing base pairs 
due to their propensity to induce significant local distor- 
tions to the base pairing geometry. Stand-alone analysis of 



Table 1. Fraction of crystal structures that fall within the predicted 
topologically allowed space for non-GU closing base pair crystallo- 
graphic RNA junctions identified with the RNA FRABASE (28) and 
fraction of total unique ah |6h Yh permutations allowed for the 
idealized, non-ideal and idealized with translation helical systems 



Bulge 


Fraction 


Idealized 


Range of 


Translation 


type 


of PDBS 


o-h |8h Th 


non-ideal 


"h |8h Yh 




sampled 


allowed 


"h /3h Th 


allowed 




(total) 




allowed (mean) 


(w/o 5° Err) 


SiSo 


1.0 (728) 


0.048 


0.047-0.055 (0.049) 


0.062 (0.028) 


S2S0 


1.0 (160) 


0.094 


0.090-0.10 (0.098) 


0.14 (0.093) 


S3S0 


1.0 (65) 


0.19 


0.20-0.24 (0.22) 


0.33 (0.23) 


S4S0 


1.0 (10) 


0.37 


0.38-0.45 (0.42) 


0.61 (0.49) 



Topologically allowed spaces represent the intersection of the sterically 
and connectivity allowed spaces with the addition of the ah Ph Th ac- 
cessible to a continuous helix and 5° error padding in each angle. 



all 899 GU-containing junctions showed that 87% fall 
within our predicted ranges, where those falhng outside 
are on average 9° from the nearest predicted point 
(Supplementary Figure S7). This suggests that sequence 
induced helix distortions can play a role in modifying 
the topologically allowed space. Indeed, 100% of the junc- 
tions containing GU pairs fall within at least one of two 
allowed distributions generated from non-idealized GU 
closing base pair sequences. However, it should be noted 
that comparison between these distributions and the 
idealized derived distribution share the majority of their 
points. 

An interesting observation of Figure 3B and 
Supplementary Figure S7 is that certain regions of the 
allowed space are rarely, if ever, sampled in the PDB. 
This includes conformations where helices bend into 
each other along the major-groove face, which likely 
leads to unfavorable electrostatic effects. Further refine- 
ment of the model with incorporation of electrostatic 
interactions and larger van der Waals radh, as mentioned 
above, would likely reveal that such states are largely 
inaccessible. 

Dynamic properties of the topologically allowed 
space. Prior studies of two-way junction motifs suggest 
that these systems are highly dynamic on the nanosecond 
to microsecond timescales (25,38), and in the case of 
HIV-1 TAR (23,39), roughly sample the entire topologic- 
ally aUowed space. If bulge systeins are in fact samphng 
the entire topologically allowed space one would expect an 
ensemble average over aU conformations to be similar to 
that observed from experiments with sensitivities less than 
these timescales. Remarkably, if we assume each conform- 
ation is equally weighted and calculate the ensemble 
average bend, or expected from the topologically 
allowed space, we obtain a result that closely replicates 
the values Zacharias and Hagerman (18) obtained for 
polyA and polyU bulges using transient electronic bi- 
refringence (Figure 4A). These values are also in agree- 
ment with estimates from FRET measurements of RNA 
of Gohlke et al. (34) and in qualitative agreement with 
studies performed on DNA (34,40,41). A corollary to 
these results is that the bent state is entropically favored 
for longer loops. Disregarding the conformational prefer- 
ences of the loop, there is a significant entropic penalty to 
forming bent states (|/6h| > ~60) for Si So and S2S0 bulges, 
while the opposite holds true for Sx>3So bulges, where 
forming a coaxial stack (restricting |/ih|<~35) is dis- 
favored (Figure 4B). Thus, the varying conformational 
preferences of different junction topologies can be 
attributed to secondary structure encoded modulation of 
the topologically allowed space. 

Internal Loops 

Structural survey of internal loop conformations. In 
previous work. Bailor et al. (23) showed that the behavior 
of asymmetric internal loops can be explained by trans- 
forming a given internal loop into a reduced bulge repre- 
sentation. This is achieved by maxiinizing stacking and 
non-canonical base pairing between residues in the internal 
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N ipjn 

Figure 4. (A) The inter-helical bend angle for N bulged nucleotides for poly-A (dashed black) and poly-U (dashed red) RNA bulges as measured by 
Zacharias and Hagerman (18) in the absence of Mg''^, and averaged over the total topologically allowed space as a function of N bulged 
nucleotides. (B) Populations of topologically accessible conformations as a function of |/ihl for SiSo (black), S2S0 (red), S3S0 (green), S4S0 (blue) and 
S5S0 (orange) bulges. The inset shows the fraction of structures from the PDB with coaxial (|/ihl < 35°), semi-bent (35° < < 60°) or bent 
(IPhI > 60°) conformations for 1253 SjSo (black), 487 S2S0 (red), 238 S3S0 (green), 15 S4S0 (blue) and 14 S5S0 (orange) junctions. 



loop. This 'reduced' bulge motif is subject to the charac- 
teristic topological constraints, but with the addition of Y 
base pairs inserted into the junction. By assuming that 
each inserted base-pair results in a change in 
«ii + yh = —34°, the twist of the A-form helix, one could 
readily generate the allowed space for asymmetric internal 
loops by shifting the ah and yh of the reduced bulge set 
of topologically allowed conformations by 7*- 17°. This 
simple model was able to explain the observation that 
the Ph Yh measured for asymmetric internal loops 
mirrored the distribution found in bulges with systematic 
shifts in och and yh- 

To investigate this assumption of maximum stacking of 
internal loop residues, we used the FR3D (30) nucleic acid 
structure analysis software to probe the conformation of 
nucleotides inside internal loops. Analysis of our PDB 
hbrary consisting of crystal structures for all SiSi, S2S1, 
S3S1, S4S1, S2S2, S3S2, S4S2, S3S3, S4S3 and S4S4 junctions 
obtained from the RNA FRABASE (28) reveals a clear 
preference for bases in the Y strand to loop into the junc- 
tion (Figure 5). Moreover, in >90% of the cases where all 
Y bases are looped in, at least an equivalent number of X 
strand bases are looped-in as well (filled bars of Figure 5). 
Interestingly, the majority of the loops that do not adopt 
fully looped-in conformations are directly involved in ter- 
tiary interactions (see 'Methods' section). Manual exam- 
ination of the structures that were not fully looped-in and 
did not have a tertiary contact revealed that all were either 
influenced by crystal contacts or appeared to be missing 
coordinates where a protein would be located. 

As hypothesized from the observations of Bailor et al. 
(23), we expected that not only would the junctions adopt 
fully looped-in conformations, but also the looped in 
bases should form non-canonical base pairs that propa- 
gate the flanking helices. To examine what fraction of 
looped in bases form non-canonical base pairs we hand 
annotated all S2S2, S3S2 and S4S2 junctions. The results 
revealed that >80% of the junctions exhibited a maximum 



number of non-canonical base pairs or near non-canonical 
base pairs, which we define as cases when bases from 
opposing strands were looped into the junction and 
roughly coplanar but did not appear to form clear hydro- 
gen bonds (Figure 5B). Results for the fraction of bases 
looped-in obtained from this annotation closely resemble 
those from FR3D (data not shown). Comparable to the 
results of Figure 5A, the fraction of structures with two 
non-canonical or near non-canonical pairs increases to 
97% when we only consider structures without tertiary 
interactions. Of the four outliers where we do not 
observe maximal pairing and which do not have tertiary 
interactions, two show maximal pairing in other structures 
reported of the same junction. Of the remaining two, one 
is located at the surface of the ribosome and we believe, 
given the level of disorder of the junction nucleotides, may 
be experiencing the influence of tertiary interactions not 
reported in the coordinate file, and the other is bound in 
close proximity by a protein and under considerable 
tensile stress. We also note that when removing junctions 
with tertiary interactions only three H3S4H3S2 junctions 
remain, one of which is a candidate for having unreported 
tertiary interactions, making it impossible to draw conclu- 
sive findings for this particular motif 

Internal loop topologically allowed space. The results from 
our structural survey motivate the assumption that the 
junction residues will fully loop in and maximize non- 
canonical base pairs. Thus, the topologically allowed space 
for a SxSy motif is identical to that for a Sx-ySo motif 
shifted by Y*-\l° in ah and Yh to account for the addition- 
al twist of F non-canonical pairs. In the case of symmetric 
internal loops, SxSx motifs reduce to SqSo motifs and are 
subject to the same constraints on ah y6h Yh that we ob- 
served for continuous helices in the PDB (Supplementary 
Figure S5). With the correction for the additional twist of 
— 17° in both ah and Yh per non-canonical base pairs, the 
SxSx allowed space contains all ah ySh Yii satisfying 
-25 < Al < 35 and -20-34*l'< (ah + Yh) < 20-34*1- on a 
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Figure 5. (A) Fraction of looped-in Y strand bases determined from FR3D (30) analysis of internal loops for H3Si-=;iHsSy=i, H3Sjt->2H5SY = 2, 
H3SA->iHsSr=3, and H3SA-=4HsSr=4 junctions. Black bars represent all structures within our database for which FR3D analysis files were available 
and gray represent the subset of these structures identified as not having tertiary interactions. The degree to which the rightmost bars (fully looped-in 
conformations) are filled represents the fraction of those structures that have at least Y number of looped-in bases on the X strand. Results represent 
474 [135] S|S,, 468 [97] SjS,, 194 [1] S3S,, 168 [6] S4S1, 315 [103] S2S2, 341 [77] S3S2, 23 [4] S4S2, 631 [74] S3S3, 165 [32] S4S3 and 153 [5] S4S4 motifs 
where [] denotes the number of structures without tertiary interactions. (B) The number of non-canonical and near non-canonical base pairs observed 
in all H3Sx>2HsS2 crystal structures obtained through a manual annotation of structures. The fraction of structures with 0, 1 or 2 pairs is plotted for 
S2S2 (solid), S3S2 (dashed) and S4S2 (open) motifs in black, with the subset of those structures free of tertiary contacts shown in gray. There are 323 
[97], 349 [46] and 23 [3] S2S2, S3S2, S4S2 motifs, respectively, where [] denotes the number of motifs without tertiary contacts. 



5°-grid, with an additional 5° in each dimension to 
account for ahgnment errors and translational degrees of 
freedom. 

As shown in Table 2, this model is able to capture the 
significant majority of conformations observed within the 
PDB. When tertiary interactions are excluded, the fraction 
of PDB conformations sampled is largely 100%, where the 
exceptions are crystal structures that have crystal packing 
interactions at the junction, or are very disordered junc- 
tions that appear to be missing protein or other coordin- 
ates in close proximity to the junction site. 

Examination of the tertiary contact free cryo-EM 
(cEJM) internal loop junctions shows that 89% of 75 junc- 
tions fall within our predicted region. Of the eight outliers, 
seven are from a de novo cEM-docked model of the ribo- 
some built before a complete crystal structure was avail- 
able [(1C2W),42], and the other is a cEM-docked 
homology model of an rRNA segment whose full crys- 
tal structure possesses extensive protein interactions 
[(2G05),43]. Analysis of 83 tertiary contact-free NMR 
internal loops revealed that 85% of the NMR structures 
agree with our predictions, where outliers were on average 
~10° from the nearest allowed conformation. This 
decrease in performance is a result of outlier junctions 
adopting conformations that are not fully non-canonically 
or near-non-canonically paired, or, when paired, pro- 
pagating with the helical twist that is significantly different 
than —34°. While it is certainly possible that the more 
flexible RNAs characterized through NMR adopt con- 
formations not in agreement with our predictions, we 
note that 10 of the 12 outliers, and afl of those >10° 



Table 2. The fraction of internal loop motifs that fall within the pre- 
dicted topologically allowed space, for all crystallographic junctions 
identified with the RNA FRABASE (28), the subset of those junc- 
tions without tertiary interactions, and the fraction of the total 
3!h Ph Th conformations predicted as allowed 



Junction 


PDBs 


PDBs 




type 


sampled 


w/o tert. 


allowed 




(total) 


contacts 








sampled 








(total) 




s,s, 


0.99 (482) 


1.00 (135) 


0.041 


S-,S2 


0.79 (323) 


0.99 (103) 


0.041 


S3S3 


0.94 (641) 


1.00 (74) 


0.040 


S4S4 


0.97 (155) 


0.60 (5) 


0.039 




1.00 (482) 


1.00 (97) 


0.048 


S3S2 


0.99 (349) 


0.96 (77) 


0.047 


S4S3 


0.26 (165) 


0.56 (32) 


0.046 


S3S, 


0.95 (198) 


1.00 (1) 


0.093 


S4S2 


1.00 (23) 


1.00 (4) 


0.092 


S4S1 


1.00 (170) 


1.00 (6) 


0.19 



outside, are structures solved without the use of residual 
dipolar coupfing (RDC) restraints. As recently shown by 
Summers and colleagues (44) these structures are particu- 
larly reliant on the force field (and any deficiencies found 
therein) used in the structure refinement. That these 
outliers may be due to structural inaccuracies is also sup- 
ported by MolProbity (37) analysis showing that each 
outlier suffers from steric violations and/or deviations 
from standard backbone angle stereochemistry around 
the site of the junction (data not shown). Finally, we 
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note that analysis of the entire structural ensemble of each 
outlier showed that, expect for one molecule [(1TLR),45], 
at least several, and on average ~40%, of the reported 
structures feU within the allowed space, demonstrating 
that experimental constraints are not precluding agree- 
ment with our model. 

We note that we previously approximated the symmet- 
ric loop case using the set of allowed conformations for a 
Si So bulge system that was shifted by —17° in and —17° 
in Yh per looped base (23). This approximation achieves 
similar results as those above, but fails to provide a 
physical model for its success. The reason for this agree- 
ment is because of significant overlap between the two 
allowed distributions — ~50% of the points are the same. 

Combined, the results of our structural surveys and the 
success of our internal loop predicted topologically 
ahowed space strongly argue for a 'reduced' bulge model 
of internal loops in the absence of external tertiary con- 
tacts. This view represents a significant simplification in 
the hnk between secondary and tertiary structure and 
suggests an additional level of constraints that may be 
incorporated in modehng RNA 3D structure from second- 
ary structure (22). In particular, these constraints could be 
implemented in the de novo prediction of smaller RNAs 
(46,47) that do not possess long-range tertiary contacts, 
significantly reducing the conformational space that must 
be searched and thereby limiting computation time. The 
topologically allowed space might also be useful as a 
global orientation restraint in the refinement of NMR 
structures. Finally, in larger RNAs where tertiary 
contacts are expected to be present, such constraints 
could be implemented as a function of the proximity of 
sequence-distant RNA to the internal loop in question; on 
close approach, the constraints would be relaxed, allowing 
greater conformational freedom. 

Interplay between tertiary interactions and topologically 
allowed conformations. Recent studies (24) have put 
forth the idea that topological constraints can act to pref- 
erentially select for the formation certain tertiary contacts 
over others, thereby providing a mechanism for secondary 
structure control of RNA folding and 3D structure. 
Intriguingly, our results suggest that there may be another 
dimension to the relationship between the secondary 
structure encoded conformational space and tertiary inter- 
actions. An implication of Table 2 is that tertiary inter- 
actions can modify the topologically allowed space by 
looping out junction residues from their default, maximal- 
ly non-canonically paired conformation. Thus, tertiary 
interactions can affect the global conformation of RNA 
by modifying the topological constraints encoded by junc- 
tions. Several examples of tertiary contacts significantly 
modifying the conformation of an internal loop can be 
found in the hterature. Most dramatically, the S2S2 
motif of the Lll protein binding site in the 23S rRNA 
shifts from a fully non-canonically paired conformation 
to one where a base distal in sequence from the junction 
inserts to the middle of the junction and is stabilized 
by interactions with Lll (48,49), with similar behavior 
observed for closely related mutants (50). In the process, 
the junction is shifted from within the topologically 



allowed space to >45° outside. Also from the ribosome, 
the UAA/GAN internal loop is significantly perturbed 
from its fully non-canonically paired conformation in 
the free state when bound by proteins (51-53). Examples 
of RNA tertiary contacts modifying internal loop con- 
formation can be found in the GAAA receptor of 
Tetrahymena ribozyme P4-P6 domain (45,54), and stem 
A and stem B of the hairpin ribozyme (55-57). In some 
cases, downstream structure may also act to stabihze tran- 
sient deviations from what would otherwise be a fully 
looped-in internal loop conformation, consistent with 
behavior of the J5/5a hinge of the Tetrahymena 
ribozyme in its folded and unfolded states (54,58-60). It 
is well known that internal loops serve as important sites 
in protein binding and intra-RNA tertiary interactions 
(61-64), and that their sequence-specific ability to recog- 
nize their binding partners plays a crucial role in 
stabilizing specific RNA 3D folds (62,65,66). However, 
that these tertiary interactions, once formed, may in fact 
be also enabling the molecule to sample new, otherwise 
inaccessible inter-helical conformations could offer a new 
paradigm to understanding RNA folding. 



ACCESSIBILITY 

Files containing the predicted topologically aUowed spaces 
presented above may be obtained in plain text file format 
at: http://hashimi.biop.lsa.umich.edu/sites/default/files/ 
topallowedspace.tgz. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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